Lorem ipsum dolor sit amet, consectetur adipiscing elit. Donec sit amet posuere massa. Donec dignissim enim quis ligula pretium congue. Vivamus at nisl at turpis dictum iaculis sit amet at nunc. In faucibus, nunc quis suscipit bibendum, sapien magna scelerisque dui, at lacinia enim risus non lectus. Mauris vel dignissim leo. Aenean interdum sem dolor. In ac mauris non quam scelerisque mattis. Mauris aliquet, magna sed convallis dictum, lectus elit cursus quam, in scelerisque orci nulla auctor eros.

Integer ac volutpat urna. Duis elit dui, eleifend sit amet faucibus nec, feugiat quis purus. Ut blandit aliquet sapien, sit amet interdum risus blandit at. Ut finibus tellus blandit lacus ornare malesuada. Praesent semper ex non eros hendrerit, ut rhoncus tellus porttitor. Mauris urna turpis, pellentesque in nisl id, ornare eleifend enim. Sed consectetur ex molestie lectus gravida euismod.

Reading data set

## [1] "C:/Users/rbp79/OneDrive/FLORIDAPOLY/DataVisualization/final/dataviz_final_project/project-02"

Visualization of the dataset

Structure of the datase

## 'data.frame':    20640 obs. of  10 variables:
##  $ longitude         : num  -122 -122 -122 -122 -122 ...
##  $ latitude          : num  37.9 37.9 37.9 37.9 37.9 ...
##  $ housing_median_age: int  41 21 52 52 52 52 52 52 42 52 ...
##  $ total_rooms       : int  880 7099 1467 1274 1627 919 2535 3104 2555 3549 ...
##  $ total_bedrooms    : int  129 1106 190 235 280 213 489 687 665 707 ...
##  $ population        : int  322 2401 496 558 565 413 1094 1157 1206 1551 ...
##  $ households        : int  126 1138 177 219 259 193 514 647 595 714 ...
##  $ median_income     : num  8.33 8.3 7.26 5.64 3.85 ...
##  $ ocean_proximity   : chr  "NEAR BAY" "NEAR BAY" "NEAR BAY" "NEAR BAY" ...
##  $ median_house_value: int  452600 358500 352100 341300 342200 269700 299200 241400 226700 261100 ...

Data Summary

##    longitude         latitude     housing_median_age  total_rooms   
##  Min.   :-124.3   Min.   :32.54   Min.   : 1.00      Min.   :    2  
##  1st Qu.:-121.8   1st Qu.:33.93   1st Qu.:18.00      1st Qu.: 1448  
##  Median :-118.5   Median :34.26   Median :29.00      Median : 2127  
##  Mean   :-119.6   Mean   :35.63   Mean   :28.64      Mean   : 2636  
##  3rd Qu.:-118.0   3rd Qu.:37.71   3rd Qu.:37.00      3rd Qu.: 3148  
##  Max.   :-114.3   Max.   :41.95   Max.   :52.00      Max.   :39320  
##                                                                     
##  total_bedrooms     population      households     median_income    
##  Min.   :   1.0   Min.   :    3   Min.   :   1.0   Min.   : 0.4999  
##  1st Qu.: 296.0   1st Qu.:  787   1st Qu.: 280.0   1st Qu.: 2.5634  
##  Median : 435.0   Median : 1166   Median : 409.0   Median : 3.5348  
##  Mean   : 537.9   Mean   : 1425   Mean   : 499.5   Mean   : 3.8707  
##  3rd Qu.: 647.0   3rd Qu.: 1725   3rd Qu.: 605.0   3rd Qu.: 4.7432  
##  Max.   :6445.0   Max.   :35682   Max.   :6082.0   Max.   :15.0001  
##  NA's   :207                                                        
##  ocean_proximity    median_house_value
##  Length:20640       Min.   : 14999    
##  Class :character   1st Qu.:119600    
##  Mode  :character   Median :179700    
##                     Mean   :206856    
##                     3rd Qu.:264725    
##                     Max.   :500001    
## 

Ploting data distribution

longitude latitude housing_median_age total_rooms total_bedrooms population households median_income median_house_value
longitude 1.0000000 -0.9246644 -0.1081968 0.0445680 NA 0.0997732 0.0553101 -0.0151759 -0.0459666
latitude -0.9246644 1.0000000 0.0111727 -0.0360996 NA -0.1087847 -0.0710354 -0.0798091 -0.1441603
housing_median_age -0.1081968 0.0111727 1.0000000 -0.3612622 NA -0.2962442 -0.3029160 -0.1190340 0.1056234
total_rooms 0.0445680 -0.0360996 -0.3612622 1.0000000 NA 0.8571260 0.9184845 0.1980496 0.1341531
total_bedrooms NA NA NA NA 1 NA NA NA NA
population 0.0997732 -0.1087847 -0.2962442 0.8571260 NA 1.0000000 0.9072223 0.0048343 -0.0246497
households 0.0553101 -0.0710354 -0.3029160 0.9184845 NA 0.9072223 1.0000000 0.0130331 0.0658427
median_income -0.0151759 -0.0798091 -0.1190340 0.1980496 NA 0.0048343 0.0130331 1.0000000 0.6880752
median_house_value -0.0459666 -0.1441603 0.1056234 0.1341531 NA -0.0246497 0.0658427 0.6880752 1.0000000

Spatial Visualization

### Interactive Plot

Map Animation

Hot Encoding Ocean Proximity

##                              longitude    latitude housing_median_age
## longitude                  1.000000000 -0.92466443        -0.10819681
## latitude                  -0.924664434  1.00000000         0.01117267
## housing_median_age        -0.108196813  0.01117267         1.00000000
## total_rooms                0.044567978 -0.03609960        -0.36126220
## total_bedrooms                      NA          NA                 NA
## population                 0.099773223 -0.10878475        -0.29624424
## households                 0.055310093 -0.07103543        -0.30291601
## median_income             -0.015175865 -0.07980913        -0.11903399
## median_house_value        -0.045966615 -0.14416028         0.10562341
## ocean_proximityINLAND     -0.055574654  0.35116598        -0.23664459
## ocean_proximityISLAND      0.009445503 -0.01657165         0.01701984
## ocean_proximityNEAR.BAY   -0.474488910  0.35877099         0.25517166
## ocean_proximityNEAR.OCEAN  0.045508838 -0.16081792         0.02162156
##                            total_rooms total_bedrooms   population   households
## longitude                  0.044567978             NA  0.099773223  0.055310093
## latitude                  -0.036099596             NA -0.108784747 -0.071035433
## housing_median_age        -0.361262201             NA -0.296244240 -0.302916009
## total_rooms                1.000000000             NA  0.857125973  0.918484493
## total_bedrooms                      NA              1           NA           NA
## population                 0.857125973             NA  1.000000000  0.907222266
## households                 0.918484493             NA  0.907222266  1.000000000
## median_income              0.198049645             NA  0.004834346  0.013033052
## median_house_value         0.134153114             NA -0.024649679  0.065842651
## ocean_proximityINLAND      0.025624325             NA -0.020732123 -0.039402469
## ocean_proximityISLAND     -0.007571767             NA -0.010412114 -0.009077005
## ocean_proximityNEAR.BAY   -0.023022417             NA -0.060880154 -0.010093339
## ocean_proximityNEAR.OCEAN -0.009175150             NA -0.024263727  0.001714434
##                           median_income median_house_value
## longitude                  -0.015175865        -0.04596662
## latitude                   -0.079809127        -0.14416028
## housing_median_age         -0.119033990         0.10562341
## total_rooms                 0.198049645         0.13415311
## total_bedrooms                       NA                 NA
## population                  0.004834346        -0.02464968
## households                  0.013033052         0.06584265
## median_income               1.000000000         0.68807521
## median_house_value          0.688075208         1.00000000
## ocean_proximityINLAND      -0.237495762        -0.48485933
## ocean_proximityISLAND      -0.009228171         0.02341608
## ocean_proximityNEAR.BAY     0.056196803         0.16028448
## ocean_proximityNEAR.OCEAN   0.027343611         0.14186217
##                           ocean_proximityINLAND ocean_proximityISLAND
## longitude                           -0.05557465           0.009445503
## latitude                             0.35116598          -0.016571648
## housing_median_age                  -0.23664459           0.017019840
## total_rooms                          0.02562432          -0.007571767
## total_bedrooms                               NA                    NA
## population                          -0.02073212          -0.010412114
## households                          -0.03940247          -0.009077005
## median_income                       -0.23749576          -0.009228171
## median_house_value                  -0.48485933           0.023416076
## ocean_proximityINLAND                1.00000000          -0.010614425
## ocean_proximityISLAND               -0.01061443           1.000000000
## ocean_proximityNEAR.BAY             -0.24088703          -0.005498984
## ocean_proximityNEAR.OCEAN           -0.26216349          -0.005984684
##                           ocean_proximityNEAR.BAY ocean_proximityNEAR.OCEAN
## longitude                            -0.474488910               0.045508838
## latitude                              0.358770991              -0.160817925
## housing_median_age                    0.255171663               0.021621556
## total_rooms                          -0.023022417              -0.009175150
## total_bedrooms                                 NA                        NA
## population                           -0.060880154              -0.024263727
## households                           -0.010093339               0.001714434
## median_income                         0.056196803               0.027343611
## median_house_value                    0.160284484               0.141862170
## ocean_proximityINLAND                -0.240887033              -0.262163488
## ocean_proximityISLAND                -0.005498984              -0.005984684
## ocean_proximityNEAR.BAY               1.000000000              -0.135818271
## ocean_proximityNEAR.OCEAN            -0.135818271               1.000000000

Linear Regression Analysis

## 
## Call:
## lm(formula = median_house_value ~ ., data = enc_data)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -556980  -42683  -10497   28765  779052 
## 
## Coefficients:
##                             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)               -2.270e+06  8.801e+04 -25.791  < 2e-16 ***
## longitude                 -2.681e+04  1.020e+03 -26.296  < 2e-16 ***
## latitude                  -2.548e+04  1.005e+03 -25.363  < 2e-16 ***
## housing_median_age         1.073e+03  4.389e+01  24.439  < 2e-16 ***
## total_rooms               -6.193e+00  7.915e-01  -7.825 5.32e-15 ***
## total_bedrooms             1.006e+02  6.869e+00  14.640  < 2e-16 ***
## population                -3.797e+01  1.076e+00 -35.282  < 2e-16 ***
## households                 4.962e+01  7.451e+00   6.659 2.83e-11 ***
## median_income              3.926e+04  3.380e+02 116.151  < 2e-16 ***
## ocean_proximityINLAND     -3.928e+04  1.744e+03 -22.522  < 2e-16 ***
## ocean_proximityISLAND      1.529e+05  3.074e+04   4.974 6.62e-07 ***
## ocean_proximityNEAR.BAY   -3.954e+03  1.913e+03  -2.067  0.03879 *  
## ocean_proximityNEAR.OCEAN  4.278e+03  1.570e+03   2.726  0.00642 ** 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 68660 on 20420 degrees of freedom
##   (207 observations deleted due to missingness)
## Multiple R-squared:  0.6465, Adjusted R-squared:  0.6463 
## F-statistic:  3112 on 12 and 20420 DF,  p-value: < 2.2e-16